ollama models

发表于 2025-06-08| 更新于 2025-06-08|20-areasai

阅读量:|评论数:

Ollama Models

ollama run hf.co/sm54/FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview-Q4_K_M-GGUF
ollama pull qwen2.5:32b

build\Release\llama-cli.exe -m llama3-70b.gguf —n-gpu-layers 81 -n 4096 —threads 24

g:\project\llama.cpp\build\bin\Release\llama-server.exe -m llama3-70b.gguf —n-gpu-layers 81 -n 4096 —threads 24 —port 8080

Basic web UI can be accessed via browser: http://localhost:8080
Chat completion endpoint: http://localhost:8080/v1/chat/completions

Coding LLM Model Ollama Command

相关推荐

AI local knowledge management

MCP vs Function Call vs Agent for AI

ai_deepresearch_guide

local_ollama_embedded_models

c_plus_plus_teacher prompt

可复现的个人交易策略

评论

本地搜索

由 hexo-generator-search 提供支持